Bayesian parameter estimation for automatic annotation of gene functions using observational data and phylogenetic trees
نویسندگان
چکیده
Gene function annotation is important for a variety of downstream analyses genetic data. But experimental characterization remains costly and slow, making computational prediction an endeavor. Phylogenetic approaches to have been developed, but implementation practical Bayesian framework parameter estimation outstanding challenge. We developed computationally efficient model evolution gene annotations using phylogenies based on Markov Chain Monte Carlo estimation. Unlike previous approaches, our method able estimate parameters over many different phylogenetic trees functions. The resulting agree with biological intuition, such as the increased probability change following duplication. performs well leave-one-out cross-validation, we further validated some predictions in scientific literature.
منابع مشابه
Automatic estimation of regularization parameter by active constraint balancing method for 3D inversion of gravity data
Gravity data inversion is one of the important steps in the interpretation of practical gravity data. The inversion result can be obtained by minimization of the Tikhonov objective function. The determination of an optimal regularization parameter is highly important in the gravity data inversion. In this work, an attempt was made to use the active constrain balancing (ACB) method to select the...
متن کاملBayesian Estimation of Shift Point in Shape Parameter of Inverse Gaussian Distribution Under Different Loss Functions
In this paper, a Bayesian approach is proposed for shift point detection in an inverse Gaussian distribution. In this study, the mean parameter of inverse Gaussian distribution is assumed to be constant and shift points in shape parameter is considered. First the posterior distribution of shape parameter is obtained. Then the Bayes estimators are derived under a class of priors and using variou...
متن کاملBayesian Models for Phylogenetic trees
introduction: inferring genetic ancestry of different species is a current challenge in phylogenetics because of the immense raw biological data to be analyzed. computational techniques are necessary in order to parse and analyze all of such data in an efficient but accurate way, with many algorithms based on statistical principles designed to provide a best estimate of a phylogenetic topology....
متن کاملBayesian estimation of concordance among gene trees.
Multigene sequence data have great potential for elucidating important and interesting evolutionary processes, but statistical methods for extracting information from such data remain limited. Although various biological processes may cause different genes to have different genealogical histories (and hence different tree topologies), we also may expect that the number of distinct topologies am...
متن کاملFuzzy Neighbor Voting for Automatic Image Annotation
With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS Computational Biology
سال: 2021
ISSN: ['1553-734X', '1553-7358']
DOI: https://doi.org/10.1371/journal.pcbi.1007948